Skip to content

Conversation

@beomki-yeo
Copy link
Contributor

@beomki-yeo beomki-yeo commented Aug 30, 2025

This PR optimizes the sort_updated_tracks kernel where the bitonic sort is used.
The changes are quite rudimentary - using shared memory and caching the payload variable into register, etc. I could have applied the warp-level primitives as #1142 but this can be done later.

The performance improvement is quite minor (<1%) but still better than nothing

@beomki-yeo beomki-yeo added improvement Improve an existing feature AI assistance Assisted by AI labels Aug 30, 2025
@sonarqubecloud
Copy link

@beomki-yeo beomki-yeo marked this pull request as draft September 22, 2025 23:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

AI assistance Assisted by AI improvement Improve an existing feature

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant